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There is no more impressive form of literature than the narrative epic 
poem. That combination of depth and breadth of conception which some have 
called sublimity has here found a natural and adequate expression. The theory 
of number is the epic poem of mathematics. The mutual reflection of the two 
arts will supply a sort of explanation in the intellectual dimension of the epic 
quality in both. . . 

But the question, what is a number? is an invitation to analyze counting 
and to find out what sort of thing makes counting possible. In poetry I suppose 
the corresponding question would be, what makes recounting possible? The 
answer, if it were to be complete, would take us into the most abstract and 
subtle mathematical thought. But the key to the problem is the simplest sort 
of insight. The same peculiar combination of simplicity and subtlety is involved 
in the theory of narrative, but as everyone knows, insight here belongs to the 
most common of common-sense conceptions. 

— Scott Buchanan, Poetry and Mathematics [H p. 64] 
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o. Numbers and sets 

The set N of natural numbers is described by the so-called Peano Axioms [28]. 
These refer to the following three features of N. 

1. It has a distinguished initial element, denoted by 1 or 0, depending on the 
writer. 

2. It has a distinguished operation of succession, which converts an element into 
its successor. The successor of x is often denoted by x + 1, though I shall also 
use s(x) and x s . 

3. In particular, the operation of succession takes a single argument (unlike addition 
or multiplication, which takes two arguments). 

The class V of all setfQ also has three features. I shall develop an analogy between the 
three features of N listed above and the following three features of V. 

1. It has a distinguished element, the empty set, denoted by or 0. 

2. It has one kind or 'type' of non-empty set. This is by the Extension Axiom, 
according to which a set is determined solely by its elements (or lack of elements). 
If sets are considered as boxes of contents, then the contents may vary, but the 
boxes themselves are always cardboard, say, — never plastic or steel. 

3. Each non-empty set has a single kind or 'grade' of element; the elements are not 
sorted into multiple compartments as in an angler's tackle-box. 

Von Neumann's definition of the ordinal numbers [42] gives us, in particular, the 
natural numbers. The von Neumann definition of the natural numbers can be seen as a 
construction of N within V by means of the analogy between N and V just suggested. In 
general, N can be understood as a free algebra whose signature comprises the constant 
(or miliary function-symbol) 1 and the singular^ function-symbol s. Von Neumann's 
definition gives us the particular free algebra denoted by w, in which 1 is interpreted as 
0, and s is interpreted as the function x 1— ¥ xU {x}. A similar set-theoretic construction 
will give us a free algebra in an arbitrary algebraic signature, as long as there is a type of 
set for each symbol in the signature, and sets having the type of an n-ary symbol have n 
grades of members. There is a corresponding generalization of the notion of an ordinal 
number as well. I work out the details of this construction at the end of this article, 
in Part [3} Meanwhile, in Part [TJ I consider what I hold to be misconceptions about 
numbers. It was these considerations that led to the whole of this paper. In Part [2J I 
review the logical development of numbers from a historical perspective. 



lr That is, hereditary sets, namely sets whose elements are hereditary sets p. 9]. 

2 Although the term 'unary' is commonly used now in this context, I follow Quine, Church [4J n. 29, 
p. 12], and Robinson [29, pp. 19, 23] in preferring the term singulary. The first five Latin distributive 
numbers are singul-, bin-, tern-, quatern-, and quin- [27], and these are apparently sources for the terms 
binary, ternary, quaternary, and quinary. The Latin cardinals are un-, du-, tri-, quattuor, quinque. (The 
hyphens stand for variable case-endings.) 
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Part 1. 

x. Numbers and geometry 

It is worthwhile to pay close attention to the fundamental properties of N. We may 
learn these wrongly in school. We may be taught — rightly — that N has the following 
properties: 

1. It admits proofs by induction: every subset that contains 1 and that contains 
k + 1 whenever it contains k is the whole set. 

2. It is well-ordered: every non-empty subset has a least element. 

3. It admits proofs by 'complete' or 'strong' induction: every subset that 
contains 1 and that contains k + 1 whenever it contains 1, . . . , k is the whole set. 

4. It admits recursive definitions of functions: on it, there is a unique function h 
such that h(l) = a and h(k + 1) = f(h(k)), where / is a given operation on a set 
that has a given element a. 

We may also be taught that these properties are equivalent. But they are not equivalent. 
Indeed, it is 'not even wrong' to say that they are equivalent, since properties [1] and [4] 
are possible properties of algebras in the signature {l,s}, while [2] is a possible property 
of ordered sets, and [3] is a possible property of ordered algebras. 

A well-ordered set without a greatest element can be made into an algebra in the 
signature {l,s} by letting 1 be the least element and letting s(x) be the least element 
that is greater than x; but this algebra need not admit induction in the sense of [1] 
Indeed, an example of such an algebra is the algebra of ordinals that are less than cu + cu. 
Conversely, an algebra in the signature {l,s} that admits induction need not admit an 
ordering < such that x < s(x): finite cyclic groups provide examples. 

The four properties listed above can become equivalent under additional assumptions; 
but these assumptions are not usually made explicit. Such assumptions are made explicit 
below in Theorems [3] and [gj I present these theorems, not as anything new, but as being 
insufficiently recognized. 

Perhaps the real problem is a failure to distinguish between what I shall call naturalistic 
and axiomatic approaches to mathematics. In the former, mathematical objects exist out 
in nature; we note some of their properties, while perhaps overlooking other properties 
that are too obvious to mention. There is nothing wrong with this. Euclid's Elements 
\TT\ fT2"j is naturalistic, and beginning mathematics students would be better served by a 
course of reading Euclid than by a course in analytic geometry or in the abstract concepts 
of sets, relations, functions, and symbolic logic. 

Not all of Euclid's propositions follow logically from his postulates and common no- 
tions. He never claims that they dojl His Proposition 1. 4 establishes the 'side-angle-side' 
condition for congruence of triangles, but the proof relies on 'applying' one triangle to 
another. Perhaps this 'application' is alluded to in his Common Notion 4: 'Things which 
coincide with one another are equal to one another.' Suppose indeed that in triangles 
ABC and DEF in the figure below, sides AB and DE are equal, and AC and DF are 
equal, and angles BAC and EDF are equal. If the one triangle is 'applied' to the other, 
A to D and AB to DE, then B and E will coincide, and also C and F, by a sort of 



3 Heath [TT1 I, pp. 221 f.] discusses the possibility that the the so-called Common Notions in the 
Elements are a later interpolation. 
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converse to Common Notion 4 that Euclid does not make explicit: things that are equal 
can be made to coincide]! Then BC and EF coincide; indeed, as observed in a remark 
that is bracketed in [tt] . but omitted in fjj], if the two straight lines do not coincide, then 
'two straight lines will enclose a space: which is impossible.' Again, this is a principle so 
obvious as not to be worth treating postulate. 

Without any explicit postulates, Euclid proves his number-theoretic propositions, from 
the Euclidean algorithm (VII. 1-2) to the perfectness of the product of 2 n and 1 + 2 + 

4 H h 2 n if the latter is prime (IX.36). 

Hilbert's approach in The Foundations of Geometry |i8| is axiomatic. Hilbert himself 
seems to see his project as the same as Euclid's: 

Geometry, like arithmetic, requires for its logical development only a small num- 
ber of simple, fundamental principles. These fundamental principles are called 
the axioms of geometry. The choice of the axioms and the investigation of their 
relations to one another is a problem which, since the time of Euclid, has been 
discussed in numerous excellent memoirs to be found in the mathematical lit- 
erature. This problem is tantamount to the logical analysis of our intuition of 
space. 

The following investigation is a new attempt to choose for geometry a simple 
and complete set of independent axioms and to deduce from these the most 
important geometrical theorems in such a manner as to bring out as clearly as 
possible the significance of the different groups of axioms and the scope of the 
conclusions to be derived from the individual axioms. 

However, it is not at all clear that Euclid is concerned with relations among axioms. 
Proving the independence of one axiom from others usually means showing that the lat- 
ter have a model in which the former is false. Hilbert can do this, but Euclid makes 
no suggestion of different models of his postulates. Euclid makes observations and de- 
ductions about the world — the yfj of YE"^£Tpta@ By constrast, while Hilbert 's axioms 

4 Such coinciding, in the case of plane figures, may involve cutting and rearranging, as in the series of 
propositions that begins with 1. 35: parallelograms with the same base in the same parallels are equal. 



5 Euclid does not discuss his work, and in particular he does not use the word yeay.£T:p'ia. Herodotus 
does use the word, a century or two earlier: 

For this cause Egypt was intersected [by canals]. This king [Sesostris] moreover (so 
they said) divided the country among all the Egyptians by giving each an equal square 
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are obviously true in the world of Euclid, Hilbert nonetheless feels the need to establish 
their consistency by constructing a model. His model is an 'analytic' one (in the sense 
of analytic geometry): it is based on the field obtained from the number 1 by closing 
under +, — , x, -j-, and x i— > yl + x 2 . For Hilbert, it seems, such a field is more real than 
Euclidean space, even though our intuition for fields, and especially for taking square 
roots in them, comes from geometry. 

The approach to N in which properties Erg] above are 'equivalent' is not quite natural- 
istic, not quite axiomatic. Considered as axioms in the sense of Hilbert, the properties 
are not meaningfully described as equivalent. But if the properties are to be understood 
just as properties of the numbers that we grew up counting, then it is also meaningless 
to say that the properties are equivalent: they are just properties of those numbers. 

An axiomatic approach to mathematics may lead to a constructive approach, whereby 
we create our objects of study. Hilbert could have ignored his axioms and concentrated on 
his model of them, although this did not happen to be his purpose. Regarding N, we may 
or may not wish ultimately to take a constructive approach, creating a model of the Peano 
Axioms by, for example, the method of von Neumann. If we do take this approach, then 
we presumably have recognized something more fundamental than the natural numbers: 
perhaps sets, which we may have approached naturalistically or axiomatically. But then 
the construction of the natural numbers may add to our understanding of, and confidence 
in, these sets. This is indeed how I see the von Neumann construction, along with the 
generalization in Part below. 

2. Numbers, quasi-axiomatically 

Through Spivak's Calculus [35] , I had my first encounters with serious mathematics. 
Because this book is worthy of scrutiny, I examine here its treatment of foundational 
matters. In the beginning, Spivak takes the real numbers as being out in nature. He 
just calls them 'numbers,' and in his first chapter he identifies twelve of their properties. 
These properties will, in his chapter 27, be recognized as the axioms for ordered fields. 
Meanwhile, in some proofs in his chapter 1, Spivak uses assumptions that he goes on 
to identify as having been unjustified. For example, to prove that, if a product is zero, 
then one of the factors must be zero, Spivak observes that, if a ■ b = 0, and a ^ 0, then 
a -1 • (a • b) = 0. Thus he tacitly assumes that zero times anything is zero; on the next 
page, he comes clean and proves this. I find no fault here. 



parcel of land, and made this his source of revenue, appointing the payment of a yearly 
tax. And any man who was robbed by the river of a part of his land would come to 
Sesostris and declare what had befallen him; then the king would send men to look 
into it and measure the space by which the land was diminished, so that thereafter 
it should pay in proportion to the tax originally imposed. From this, to my way of 
thinking, the Greeks learnt the art of measuring land [yecjuetpit]]; the sunclock and 
the sundial, and the twelve divisions of the day, came to Hellas not from Egypt but 
from Babylonia. \Tf\ 2.109] 

For YEtJuetpiri (the Ionic form of the word) here, other translators, as |i6| . just use 'geometry'. It is too 
simple to say that geometry developed from surveying, since it was not necessary for geometry as we 
know it to develop at all. On the other hand, if our experience of the world came entirely from gazing at 
the heavens, and if we nonetheless developed a geometry, then this would probably be what we in fact 
call spherical geometry. 



6 



DAVID PIERCE 



The difficulties for me arise in Spivak's chapter 2. Here the various 'sorts' of numbers 
are introduced, starting with the natural numbers. The four properties of the natural 
numbers listed above in §1] are stated, and it is said that, from each of the first three 
properties, the others can be proved. For example, Spivak proves by induction that a set 
of natural numbers without a least element must be empty: 1 cannot be in the set, since 
otherwise it would be the least element; and for the same reason, if none of the numbers 
1, . . . , k is in the set, then k + 1 is not in the set. 

The problem is that this argument uses more than induction: it uses that the set 
of natural numbers is ordered so that n^k or k+l^n for all elements k and n. 
The existence of this ordering does not follow by induction alone. It does follow, if one 
has the natural numbers as real numbers. Again, for Spivak, the real numbers are just 
'numbers', so the natural numbers must be among these; but Spivak does not emphasize 
this in the text. In exercise 25 at the end of chapter 2, an inductive set is defined as a 
set of numbers that contains 1 and is closed under addition of 1. The reader is invited 
to show that the set of numbers common to all inductive sets is inductive. It could be, 
but is not, made clear at this point that the properties called [1] and [4] in §1] above are 
equivalent properties of inductive sets. 

Spivak's Calculus was a reference text and a source of exercises for a two-year high- 
school course taught by one Donald J. BrownQ The main text for the course was notes 
that Brown wrote and distributed, either as typed and mimeographed pages, or more 
usually as writing on the blackboard that we students copied down. Brown's notes are in 
some ways more formally rigorous than Spivak. From the beginning, they give the real 
numbers explicitly as composing the complete ordered field called R. And yet the notes 
present the natural numbers as a set {1,2,3, . . . } for which the 'fundamental axiom' is 
the 'Well Ordering Principle'. From this, the 'Principle of Mathematical Induction' is 
obtained as a theorem; it is left as an exercise to obtain the Well Ordering Principle 
as a theorem from the Principle of Induction. Here again, proofs must rely on hidden 
assumptions. At the relevant point in my copy of Brown's text, I find the following 
remark, apparently added by me during the course: 'If the given "definition" of N means 
anything, it must be the equivalent of the Principle of Mathematical Induction. Thus 
the P. of M.I. need not be a theorem, and the W.O.P. need not be an axiom.' 

Part 2. 

3. Numbers, recursively 

Let us forget about R and try to start from scratch. We should distinguish clearly 
between recursion from induction. Recursion is a method of defining sets and functions; 
induction is a method of proving that one set is equal to another^ We can try to 
understand N as having the following recursive definition: 

(1) N contains 1; 

(2) if N contains x, then N contains x s . 

Since this is explicitly a definition, there is really no need for a third condition, though 
it is sometimes given: 



This was at St. Albans School in Washington, D.C., 1981-3. 
7 One text that makes the distinction clear is Enderton \Tol §1.2, pp. 22-30]. 
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(3) N contains nothing but what it is required to contain by (PQ) and (jgj). 
Again, I hold this third condition to be redundant for the same reason that the words 
only if are redundant when, for example, we define a natural number as prime if and 
only if it has just two distinct divisors. 

The recursive definition of N means no more nor less than that, if A is a subset of N 
that contains 1 and that contains the successors of all of its elements, then A = N. In 
short, the definition means that N admits proofs by induction. This can also be 
expressed by writing 

N = {l,l s ,l ss ,...}. 

The recursive definition of N does not bring N into existence. Rather, it picks out N 
from among some things that are assumed to exist already. The definition assumes that 
there is some set that contains 1 and is closed under s. Then N is the smallest such set, 
namely the intersection of the collection of such sets. 

Definitions such as the one just given have been found objectionable for being 'im- 
predicative.' Zermelo |48| p. 191, n. 8] reports that Poincare had such an objection to 
Zermelo's proof of the Schroeder-Bernstein Theorem. Let us have a look at this proof. 

Suppose A, B, and C are sets such that A C B C C, and / is a bijection from C 
onto A. We aim to find a bijection g from B onto A. If there is any hope of finding 
such g, then probably we should have 



f(x), ifxeD, 

x, if x € B \ D, 



(i) 



for some subset D of B. In this case, since the range of g is to be (a subset of) A, we 
must have in particular B \ D C A and so B \ A C D. Injectivity of g will follow from 
having f[D] C D. Let D now be the smallest X such that 

(B^A)Uf[X]QX. 

That is, let D be the intersection of the set of all such sets X. We obtain g as desired. 
Indeed, when D is such, then (B \ A) U f[D] = D. Since also F[D] C A, we have that 
B n A and f[D] are disjoint. Therefore B is the disjoint union of B \ A, f[D], and 
B \ D, as in the figure below. We conclude (B \ D) U f[D] = A, so g is indeed surjective 
onto A. 




B\A 



f[D] 



B\D 




Such is the proof in the article [57J p. 208] of Zermelo giving his axioms of set-theory, 
although Zermelo does not give the argument in the heuristic)! style that I have used. 



The historically correct word is analytic, in view of remarks like this of Pappus: 

Now analysis [dvaXoaic] is a method of taking that which is sought as though it were 
admitted and passing from it through its consequences in order to something which is 
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The argument might be said to assume the existence of what it purports to establish the 
existence of, since D belongs to the set of which it is the intersection. 

For an alternative heuristic argument, again suppose g is as in jl]), so that 

g[B]=f[D]U(B^D). (ii) 

Say g[B] = A. Then in particular g[B] C A, so by {[TTJ) we have B\DC4, hence 

B\AC D. (iii) 

Also, A C g[B], but by flu]) we have g[B) C f[B] U (B \ L>), hence 

A \ CB\D. (iv) 

Since (B \ yl) U (A \ /[B]) = B \ f[B], we now have 

g[B^f[B]]=g[(B^A)U(A^f[B])] 

= g[B^.A]Ug[A^f[B]} 

= f[B \ A] U (4 \ /[-S]) [by dm]) and 

= (/[B]n/[A])U(A\/[S]) = A\/[A]. 

Assuming g is injective, we conclude ^[/[i?]] = f[A\. Now we can proceed as before, but 
with f[B] and f[A] in place of B and A. In particular, we replace ([11]) with 

9 [/[B]] = /[On/[B]]U(/[B]\D) 

and obtain f[B] \ f[A] CO, and so forth. We find ultimately 

[j{B x A, f[B] x f[A],f[f[B]] x /[/[A]], ...}CD. 

Conversely, to ensure that g[B] = A, it suffices to let D be this union. Indeed, this is the 
same D found in Zermelo's argument above, only now it is found through a recursive 
procedure, as \J{B n \ A n : n G N}, where 

B\ = B, Ai = A, B s ( n ) = f[B n ], A s ( n ) = f[A n }. 

Is this better than finding D as the intersection of a set that contains Dl Well, it assumes 
the existence of N, whose recursive definition was accused of being impredicative. What 
is worse, the existence of the function n 1— > (A n ,B n ) is not justified merely by the 
possibility of defining N itself recursively as we did above. For this reason, I prefer 
Zermelo's argument. 

I propose to refer to any selH that has a dist ing uished element and a distinguished 
singulary operation as an iterative structureo The distinguished element can be 
called 1, and the operation, s; or they can be called l 21 and s a , to avoid ambiguity, if the 
structure itself is 21, but there are other iterative structures ar oundJB Then an iterative 



admitted as a result of synthesis [auvwecnc] ; for in analysis we suppose that which is 
sought to be already done, and we inquire what it is from which this comes about. . . and 
such a method we call analysis, as being a reverse solution [AvdnaXiv Xuolc]. |g7J) 
p. 601] 

The style of geometry pioneered by Descartes is not so much analytic as algebraic. 
9 Later the possibility of using a proper class here will be considered. 
10 Stoll I36I §2.1, p. 58] uses the term 'unary system'. 

"This is like calling me David, unless there is another David present, in which case I may become 
David Pierce. 
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structure is just an algebra in the signature {l,s}. An iterative structure meets minimal 
requirements for repeated activity: it gives us something — s — to do, and something — 1 — 
to do it to. 

Every iterative structure has a unique smallest substructure, which is the intersection 
of the collection of all substructures £j this intersection therefore admits induction and 
can be recursively defined. In particular, N is such an intersection; but N has properties 
beyond admitting induction. In particular, if 21 is another iterative structure, then there 
is a unique homomorphism h from N to 21, given by the recursive definition 

(1) h(l) = 1, that is, h(l N ) = l % ; 

(2) h{x s ) = h(x) s , that is, h(s N (x)) = s a (/i(x)). 
In short, 

^ = {(i,i),(iM s ),(i s M ss ),...}, ( V ) 

where the left-hand entries in the ordered pairs are from N; the right-hand, from A. 

We have now merely asserted the possibility of defining functions recursively on N. 
Arithmetic follows from this assertion: the binary operations of addition, multiplication, 
and exponentiation can be defined recursively in their right-hand arguments. First we 
have 

m + 1 = m s , m + x s = (m + x) s , (vi) 

that is, the function x 1— > m + x is the unique homomorphism from (N, 1, s) to (N, m s , s). 
Then s is x 1— > x + 1, and we continue with the definitions 

m ■ 1 = m, m ■ (x + 1) = m ■ x + m; ( vn ) 

m 1 = m, m x+1 = m x ■ m; ( viii) 

that is, the functions x 1— > m-x and x 1— > m x are the unique homomorphisms from (N, 1, s) 
to (N, m, x 1— > x + m) and (N, m, x 1— > x ■ m) respectively. 

Skolem [32] develops arithmetic by recursion and induction alone, though without 
discussing the logical relations between these two methods and without mentioning ho- 
momorphisms as such. Skolem's purpose is to avoid logical quantifiers. Instead of defining 
m < n to mean m + x = n for some x, he defines it by saying that m < 1 never, and 
m < k + 1 if m < k or m = k. To see this definition in terms of homomorphisms, let us 
denote the set {0, 1} by IB for BooleJ^l Every binary relation R on N corresponds to a 
binary characteristic function \R from N into B given by 

XR(x,y) = 1 <^ x Ry. 

We are defining the function x 1— > x<{ m i x ) from N to B, which is described by the 
table below. Unless m = 1, the function is not a homomorphism from (N, l,s) into 
(B, 0, t) for any choice of t, simply because there is no operation t on B that takes each 
entry in the bottom row of the table to the next entry. However, there is an operation 
on N x B that takes each column of the table to the next column, namely (x, y ) 1— > 

12 This assumes that the iterative structure is based on a set, rather than a proper class. 
13 From Boole himself [TJ p. 41]: 'We have seen that the symbols of Logic are subject to the special 
law, 

2 

x = x. 

Now of the symbols of Number there are but two, viz. o and 1, which are subject to the same formal 
law.' 
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1 


2 




m — 1 


m 


m + 1 


m + 2 




X<{m,x) 
















1 


1 





^x + 1, max(y, x=(m, x)) Y Call this operation t; then (assuming m ^ 1) the function 

x i—T- (x, X<(' 71 ) x )) is the unique homomorphism from (N, 1, s) to (N x B, (1, 0), i) . 

If / is an arbitrary function from N to an arbitrary set A, then the function x i— > 

(x, /(x)) is a homomorphism from (N, 1, s) to ^Nxi, (l, /(l)) , (x, y) i— > (x+1, /(x+1)) V 
But to define f this way would be not only impredicative, but circular. The operations 
on N that Godel [Tjj calls recursive are not defined as homomorphisms. Rather, the 
collection of recursive operations can itself be defined recursively as follows (and here 
I follow Godel in letting the initial element of N be 0): 

(1) constant operations and s are recursive; 

(2) compositions of recursive operations are recursive; 

(3) if ip is an (n — l)-ary, and /ian (n + l)-ary, recursive operation for some positive 
n, then the ra-ary operation <p is recursive, where, for each n-tuple a of natu- 
ral numbers, the function x 1— > (x, <p(x, a)) is the unique homomorphism from 

(N, 0, s) to (n x N, (0, V(a)) , (y, z) H- (y + l,n(y, z, a))) . 

Thus the recursive operations compose a sort of algebra which is closed both under 
composition and under the operation that produces (p from ip and \i. 

4. Numbers, axiomatically 

Why does N admit recursive definitions of functions? The function h in (jvj) can be 
understood as the smallest subset of N x A that contains (1,1) and is closed under the 
operation (x,y) 1— > (x s ,y s ). This set is a relation R from N to A, and it follows by 
induction that, for each x in N, there is at least one y in A such that x R y. If this 
y is always unique, then R is the desired homomorphism h. However, uniqueness of y 
does not follow by induction alone: it is necessary, as well as sufficient, that 1 be not a 
successor in N and that immediate predecessors, when they do exist, be unique. This is 
Theorem [3] below. 

Dedekind [jj II (130), pp. 88-90] observes the sufficiency of the additional conditions, 
along with the necessity of some additional condition. Indeed, let us say that an arbitrary 
iterative structure 21 admits recursion if, for every iterative structure 03, there is a 
unique homomorphism from 21 to 23. In particular, N admits recursion. If also 21 admits 
recursion, then the homomorphism from 21 to N must be the inverse of the homomorphism 
from N to 21. Indeed, the composition (in either sense) of the two homomorphisms is a 
homomorphism, but so is the identity, hence these must be equal. Therefore 21 and N 
are isomorphic, and 21 admits induction, simply because N does. However, as Dedekind 
notes, the converse does not follow; 21 may admit induction, but not recursion. Indeed, 
for any natural number n, the quotient Z/(n) can be considered as an iterative structure 
whose initial element is and in which the successor of x is x + 1. Then Z/(n) admits 
induction; but there is no homomorphism from Z/(n) to Z/(m) unless m \ n. 

As Landau [22, Thins 4 & 28; see also pp. ix. f.] shows implicitly, the recursive def- 
initions of addition and multiplication in (fvij) and (|viip are in fact valid on arbitrary 
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iterative structures that admit induction. Such structures may be finite, as in the figure 
below (where n may be 1, and m may be 0). Henkin [15] makes the point explicit, while 



12 3 n n+1 




observing that the recursive definition of exponentiation fails in some such structures. 
Indeed, since for Henkin the initial element of N is 0, his definition of exponentiation is 
not (jviiip . but 

x° = s(0), x 5 ^ =x u -x, 

which always fails in Z/(n) in case n > 1, simply because the definition requires both 
0° = 1 and n = 0. However, if we keep 1 as the initial element of N, then the following 
curiosity arises. 

Theorem 1. On Z/(n), the recursive definition (jviiip of exponentiation is valid if and 
only ifne {1,2,6,42,1806}. 

Proof. The desired n are just such that 

x n+1 = x (mod n). 

Such n are found by Zagier [4B], and earlier by Dyer-Bennet [9], as I learned by entering 
the sequence 1, 2, 6, 42, 1806 at taj. Alternatively, such n must be squarefree, since if p 
is prime and p 2 | n, then (n/p) 2 = (mod n), although n/p ^ (mod n). So we want 
to find the squarefree n whose prime factors p are such that x n+l = x (mod p), that is, 
either p \ x or x n = 1 (mod p). Since x here might be a primitive root of p, we just want 
n such that p— 1 | n whenever p is a prime factor of n. Let us refer to a prime p as good 
if p — 1 is squarefree and all prime factors of p — 1 are good. Then all prime factors of 
n must be good. This definition of good primes can be understood as being recursive. 
Indeed, the empty set consists of good primes. If A is a finite set of good primes, then 
every prime of the form 

i+n? 

pes 

where B C A, is good. Then let A' comprise these primes, along with the primes in A. 
We have a recursively defined function x 1— > A x on N, where A\ = and Ax+i = A x ' . 
The set of good primes is the union of the sets A x . We compute 

yl 2 = {2}, A 3 = {2,3}, A A = {2,3,7}, A 5 = {2, 3, 7, 43}, 

but then the sequence stops growing, since 

2 • 43 + 1 = 87 = 3 • 29; 2 • 7 • 43 + 1 = 603 = 3 2 • 67; 

2 • 3 • 43 + 1 = 259 = 7 • 37; 2 • 3 • 7 • 43 + 1 = 1807 = 13 • 139. 
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So the set of good primes is {2, 3, 7, 43}. In this set, we have 

p < q p | q — 1. 

Hence the set of desired n is {1, 2, 2-3, 2-3-7, 2 • 3 • 7 • 43}, which is as claimed. □ 

The sequence (2, 3, 7, 43) of good primes is the beginning of the sequence (ai, a2, • • • ), 
where a n +i = a% ■ a-i ■ ■ ■ a n + 1. This sequence arises from what Mazur [26J calls the 
self-proving formulation of Proposition IX. 20 of Euclid's Elements. Euclid's formulation, 
in Heath's translation, is, 'Prime numbers are more than any assigned multitude of 
prime numbers. In these terms, the self-proving formulation is that, if a multitude of 
primes be assigned, then the product of its members, plus one, is a number whose prime 
factors — of which there is at least one — are not in the assigned multitude. 

Again, as an iterative structure, N admits recursive definition of functions because 
of the theorem below. The theorem should be standard. Dedekind [71 II (126), p. 85] 
proves the reverse implication, as does, more recently, Stoll I36I ch. 2, Thm 1.2, p. 61]. 
Mac Lane and Birkhoff [2"g| mention the theorem, apparently attributing the forward 
direction to Lawvere: they refer to admission of recursion by an iterative structure as 
the Peano-Lawvere Axiom. (Lawvere and Rosebrugh [2"g] call it the Dedekind-Peano 
Axiom.) 

Theorem 2 (Recursion). For an iterative structure to admit recursion, it is sufficient 
and necessary that 

(1) it admit induction, 

(2) 1 be not a successor, and 

(3) every successor have a unique immediate predecessor. 

An iterative structure meeting the three conditions listed in the theorem is what 
Dedekind fjj II (71), p. 67] calls a simply infinite system; it is what is axiomatized by 
the so-called Peano Axioms [28] . Peano seems to assume that the Recursion Theorem is 
obvious, or at least that recursive definitions are obviously justified by induction alone. 
They are not, as Landau points out \s^\ pp. ix-x]; but the confusion continues to be 
made £1 

To establish the sufficiency of the conditions given in the theorem, we can use them 
as suggested at the head of this section. Namely, we can show that the unique homomor- 
phism, given in (jvj), from N to an arbitrary iterative structure 21 is the intersection h of the 
set of all relations from N to A that contain (1, 1) and are closed under (x,y) h-> (x s ,y s ). 
This requires assuming that N itself is indeed a set, as opposed to a proper class. With- 
out this assumption, one can obtain h as the union of the set comprising every relation 
R from N to A such that, if x R y, then either (x,y) = (1, 1) or else (x,y) = (u s ,v s ) for 
some (u, v) such that u Rv. 



14 Ol jiptJTOi apidiJLOi 7iXeiouc ekrl Tiavxoc tou tipotstSevtoc Kki/douc, 7tpwxwv otpiiL^ov [6]. Euclid says 
not that there are infinitely many prime numbers, but that there are more than can be told. 

15 Indeed, Mac Lane and Birkhoff [25! seem to invite the student to make the confusion. They give 
the Peano Axioms and then immediately use recursion to define the iterates of a permutation of a set. 
Only later is the equivalence of recursion to the axioms mentioned. Another writer who invites confusion 
is Burris [3] Appendix B, p. 391]: he states the Peano Axioms and then immediately defines addition 
recursively without justification, although he claims to be following the presentation of Dedekind. 
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Parallel to the two ways of obtaining h as in (jvj), there are two ways to obtain N itself; 
these are considered in the next section. 

5. Numbers, constructively 

The next two sections have aspects of an historical review of set-theory. However, I re- 
fer only to articles and books that I have direct access to (mainly, though not exclusively, 
in van Heijenoort's anthology [40]); therefore my review is incomplete, as the references 
in Fraenkel et al. [13] and in Levy [24] make me aware. 

In a letter to a skeptic called Keferstein, Dedekind [8j explains his work as an answer 
to the question: 

What are the mutually independent fundamental properties of the sequence N, 
that is, those properties that are not derivable from one another, but from which 
all others follow 4_j 

Having discovered these properties and made them the defining properties of a simply 
infinite system, Dedekind feels the need to ask, 

does such a system exist at all in the realm of our ideas? Without a logical 
proof of existence it would always remain doubtful whether the notion of such 
a system might not perhaps contain internal contradictions. 

Dedekind finds his proof in 'the totality S of all things, which can be objects of my 
thought' [7J II (66), p. 64]: this is closed under the injective but non-surjective operation 
that converts a thought into the thought that it is a thought. But I will say with 
Bob Dylan, 'You don't need a weatherman to know which way the wind blows.' The 
consistency of Dedekind's definition of a simply infinite system is self-evident. If we do 
construct an example, this will tend to confirm, if anything, the validity of our method 
of construction, rather than the consistency of the definition itself. 

The existence of a simply infinite system with an underlying set is the Axiom of 
Infinity of set-theory. I take set-theory to begin with the Extension Axiom mentioned in 
§DJ along with the 'Comprehension Axiom': every property determines a set, namely 
the set of objects with the property. Russell shows this 'axiom' to be false in his letter 
to Frege: 

Let w be the predicate: to be a predicate that cannot be predicated of itself. Can 
w be predicated of itself? From each answer its opposite follows. Therefore we 
must conclude that w is not a predicate. Likewise there is no class (as a totality) 
of those classes which, each taken as a totality, do not belong to themselves. 
From this I conclude that under certain conditions a definable collection does 
not form a totality. [gr] 

It is common now to refer to Russell's totalities^ as sets, and to his definable collections 
as classes. Then all sets are classes. The 'Comprehension Axiom' is the converse; but it 
is false. If alternative axioms for set-theory can be proposed, and these allow construction 
of a satisfactory example of a simply infinite system, then the construction will not really 
justify the Peano Axioms; rather, it will tend to justify the set-theory axioms. 
Zermelo [47] offers seven axiom for sets; they can be expressed as follows: 

l6 I take the liberty of writing N where van Heijenoort's text has simply N. 
17 Actually the term 'totality' in the quotation is a translation from Russell's German. 
l8 Like Zermelo, I shall not bother to distinguish axioms from axiom schemes. 
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1. Extension (as in §nj). 

2. Elementary Sets: classes 0, {x}, and {x,y}, with no, one, and two elements, 
are sets. 

3. Separation: subclasses of sets are sets. 

4. Power Set: the class &(x) of subsets of a set x is a set. 

5. Union: the union (J x of a set x is a set. 

6. Choice: the union of a set of nonempty disjoint sets has a subset that has exactly 
one element in common with each of those disjoint sets. 

7. Infinity: there is a set that contains and closed under the operation x h-> {x} 
of singleton formation. 

By the Axioms of Union and Elementary Sets, two sets x and y have a union, x L) y, 
namely Ul^'^}- By the Power Set Axiom, for every set x, there is a set, namely &(x), 
which has greater cardinality by Cantor's Theorem^ In particular, the sequence 

x, &>{x), ^(^(x)), &>(3»{&>{x))), ... (ix) 

is of strictly increasing cardinality, although we have not formulated a precise notion of 
sequence. 

By the Separation Axiom, the intersection f] C of a nonempty class C is a set. Hence, 
by the Axiom of Infinity, the intersection of the class of sets that contain and are 
closed under x 1— > {x} is a set Q. Therefore (0,0, x 1— > {x}) is a simply infinite system. 
In particular, it admits induction, and we may express this by writing 

O = {0,{0},{{0}},...}. (x) 

It would however be more satisfactory to obtain Q, as a class, without assuming the 
Axiom of Infinity. To this end, let us denote by 

Zm 

the class comprising every set whose every element is either or {x} for some x in the 
set. Then Zm contains the sets 0, {0}, |0,{0}}, |0, {0}, {{0}}|, and so forth. The 

hope is that |J Zm is the desired class Q. 

To tell whether the hope is realized, we can note first that induction makes sense 
for proper classes. In particular, suppose |J Zm has a subclass D that contains and 
is closed under x 1— > {x}. Then on the class |J Zm \ D, the function {x} 1— > x of 
unique-element extraction is well-defined, and the class is closed under this function. 
If we could conclude that (J Zm \ D must be empty — which we cannot yet do — then 
(U Zm, , x 1 — y {x}) would admit induction. 

Skolem [3*31 §4, pp. 296-7] observes that Zermelo's axioms do not guarantee the exis- 
tence of a set comprising the terms of the sequence in (jlxl) . He remedies this with the 
following axiom (also discovered by Fraenkel): 

8. Replacement: the image of a set under a function is a set. 

Skolem makes the notion of a class precise: it is defined by a formula in (what we call) 
the first-order logic of the signature {£}. These formulas can be defined recursively: 

1. Equations x = y and 'memberships' x £ y are formulas. 



19 The theorem is so called by Zermelo [47] p. 211]. 
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2. if x is a variable, and (p and ip are formulas, then -199, (99 =>• ■0), and 3x (p are 
formulas. 

More precisely, a formula in one free variable defines a class, while a formula in too free 
variables defines a binary relation, which may be a function. A binary relation becomes 
identified with a class when, with Kuratowski [21) p. 171] Q we define ordered pairs by 
the identity 

{x,y) = {{x},{x,y}}. (xi) 

If we are willing to accept a recursive definition like the definition of formulas in the 
signature {g}, then we should accept a definition of natural numbers as strings of the 
form s - • • si (or perhaps instead of the form 1 • • • 1). However, such definitions do not 
give us natural numbers or formulas as composing sets. Nor is this required for the sake 
of formulating set-theory in the first place. 

A class on which the function {x} 1— > x is a well-defined operation is one example of 
a class whose every element has nonempty intersection with the class. Skolem h^Jl §6, 
p. 298] perceives the possibility, along with the non-necessity, of such classes: this suggests 
non-categoricity of the axioms so far. Von Neumann \%T\ §5, pp-413-4] more explicitly 
proposes excluding such classes with a new axiom. His axiom can be understood as that 
there is no sequence (a, a', a", . . . ) such that a' G a, and a" £ a', and so forth. Since 
we are now engaged in determining how such a sequence can be formulated in the first 
place, let us introduce the new axiom in the following form. 

9. Foundation: membership is well-founded on every nonempty class, that is, some 
element is disjoint from the class 

With the new axiom then, the definition of f2 in (jxj) as the union |J Zm works, in that 
U Zm admits induction by the argument above. However, it is not very satisfying to 
have to rely on the new axiom. We can avoid using the axiom as such by incorporating 
it into the definition of Zm. 

Theorem 3. Let Zm' comprise those elements of Zm whose every subset x is disjoint 
from some element of x. Then ((jZm', 0,x 1— > {x}) is a simply infinite system, even 
without recourse to the Axioms of Infinity and Foundation. 



20 An earlier definition of ordered pairs is Wiener's [45]: (x,y) = |{{a;}, 0}, {{j/}} ji though Wiener 

calls it l'(l'l'x U t'A) U i l i l i l y, using the notation of Whitehead and Russell [43) pp. 28-37]. 

21 The Foundation Axiom has a set form, namely that that membership is well-founded on every 
nonempty set. See Levy [24") II. 7, p. 72] on the equivalence of the two forms in the presence of the other 
axioms. However, in the absence of the Axiom of Infinity, the set form of Foundation does not imply 
the class form given above. A proof that this is so can be adapted from a proof given by Cohen [51 II. 5, 
p. 72] to show the independence of the set form of Foundation from the other axioms. Suppose we do 
have u> in the usual sense, as at (|xiv[) below. Let C be a set {a x : x £ w} indexed by w such that no 
member is an element of another member. For example, each a n could be {n + 1}. If a is an arbitrary 
set, let 3» f (x) denote the set of finite subsets of a. Now let C* = \J{C, 9?t{C), &>t(@>f(C)), ...}. Define 
< on C* so that a < b if and only if either a G b, or else a — at+i and b = at for some k in cu. Then 
(C*, <) is a model of the first eight numbered axioms above, except Infinity; and it is a model of the set 
form of Foundation. But it is not a model of the class form of Foundation. Indeed, in (C*, <), we can 
define the class D of all sets A such that, if 6 € A, then b C A. Then D = \J{0, 3*(0), f?{f?{0)), . . . }, 
and C* \ D is a counterexample to Foundation. 
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Proof. If A G Zm', and D is a subclass of (J Zm' that contains and is closed under x \— > 
{x}, then on ^4n((J Zm'\D), the operation {x} \— > x is well-defined, so 74n(|J Zm'\D) 
must be empty. Since A was arbitrary, (J Zm' \ _D is empty. □ 

It may appear that the definition of as |J Zm' still does not yield what is desired. 
Foundation excludes sets on which the operation {x} i— l x is well-defined; but suppose A 
is a nonempty 'collection' on which {x} i— )• x is well-defined. Then the union j4 U may 
belong to Zm', unless A can be obtained as a subclass of this union. If we can obtain f2 
by other means, then we can obtain A as (iUfi)\ f2; but this just begs the question of 
whether we have as a class in the first place. 

If we think we need to fix this problem, we might try letting Zm" comprise the finite 
elements of Zm'. Here we can use for example the definition mentioned by Whitehead 
and Russell |gj *120.23]@ whereby B is finite if every subset of &(B) contains B, 
provided the subset contains and is closed under the operation x t— >■ {c} U x for each c 
in B. If Zm" had a finite element AU Qo, where A violated Foundation, then we could 
remove the elements of S7o one by one, thus obtaining A, in violation of Foundation. 
However, we still have no guarantee that we can do this, unless we already have f^o as a 
class. 

The problem here is the one that Skolem hit upon: set-theory is not categorical. 
Indeed, suppose the formula <p(x) defines the class Q as in (jxj), and introduce constants 
c, c', c", ... Then the sentences 

(p(c), c G c, tp(c'), c"£c', p(c"), 

are consistent with set-theory, in that no contradiction is derivable from them that is not 
already derivable from the axioms. We must settle for observing that, in any model of 
set theory, ip defines a class that acts as if it were as in (jxj). From outside the model, 
we may see an infinite descending chain of elements of the model; but the chain is not a 
class of the modeljfl 

6. Ordinals 

As sets may be gathered into classes, so classes may be gathered into — let us say 
families. For example, given an equivalence-relation on a class, we have the family of 
equivalence-classes of members of that class. 

Given well-ordered sets 21 and 03, let us say 21 < 03 if 21 is isomorphic to a proper 
initial segment of 03. Then exactly one of x < y, x = y, and y < x holds for all well- 
ordered sets x and y. Hence the relation < induces a well-ordering of the family of 
isomorphism-classes of well-ordered sets. 

It may be desirable to find a well-ordered class that is isomorphic to the family of 
isomorphism-classes of well-ordered sets. This will be 

ON, 



22 I learn the reference from Suppes [37] p. 102]; various definitions of finite sets are discussed by 
Tarski [38]. 

23 Edward Nelson recommends conceiving some natural numbers as nonstandard and as compos- 
ing an infinite descending chain. See Chapter 1, 'Internal set theory,' of a once- proposed book 
: //www. math. princeton. edu/~nelson/books .html , accessed March 30, 2009). 
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the class of ordinal numbers. Then every well-ordered set will be isomorphic both 
to a unique ordinal and to the well-ordered set of its predecessors in ON. With this 
motivation and observation, von Neumann obtains ON so that each element is the 
set of its predecessors. 

Suppose 21 is an order, or ordered set, (A, <): by this I mean simply that < is 
irreflexive and transitive on the set A@ A section of 21 is the set of predecessors in A 
of a particular element of A. Let the set of predecessors of x be denoted by pred(x), so 
that 

pred(x) = {y G A: y < x}. 
Then 21 admits induction if the only subset B of A for which 

pred(x) C B => x € B 

for all x in A is A itself. There are various ways to define recursion in ordered sets; it is 
simplest for present purposes to say that 21 admits recursion if, for every class C, for 
every function F from <^(C) to C, there is a unique function g from A to C given by 

g(x) = F({g(y) : y < x}) = F(g[pred(x)]). (xii) 

As with Theorem [2 the following should be standard. Indeed, possibly awareness of 
the following has led to the misconception that recursion and induction are equivalent 
properties of iterative structures. 

Theorem 4 (Recursion). For total orders, admission of recursion and induction are 
equivalent to each other and to being well-ordered. □ 

If 21 is well-ordered, and in (jxiip we let F be the identity on the universe V, then 

g( x ) = {9{y) ■■ y < x} = g[pved(x)]. (xiii) 

In this case, (<7[t4],g) is the ordinal number isomorphic to 21; indeed, this is so by von 
Neumann's original definition of the ordinal numbers p. 348]. Alternative, equivalent 
definitions include R. Robinson's [30], namely that an ordinal is a set that both is to- 
tally ordered by membership and includes all of its elements. This definition of ordinals 
assumes the Foundation Axiom, which ensures that each ordinal is weZZ-ordered by mem- 
bership. A set that includes all of its elements is a transitive set; then another definition 
of ordinals, again assuming Foundation, is that they are transitive sets of transitive sets. 

The class ON has the injective operation x 1— > x U {x} of succession as well as the 
element 0, which is not a successor. Thus ON meets two of the three conditions for being 
a simply infinite system. It fails to meet the third condition, of admitting induction, if 
there are ordinals other than that are not successors. Such ordinals are limits. The 
class of ordinals that neither are limits nor contain limits is called 

cu. ( xrv ) 

This is an initial segment of ON, so it is either ON itself or a member of it. In any case, 
(w,0,x 1 — y x U {x}) is a simply infinite system. We could let the Axiom of Infinity be 
that cu is a set. 

Fraenkel et al. ch. II, §3, 6, p. 49] are vague on whether this Axiom can be expressed 
in the form 'Such-and-such a class is a set.' They list three forms of the Axiom, called Via, 



2 4So < is what is sometimes called a strict partial ordering, although it might be total. 
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b, and c, namely: there are sets containing and closed under x \— > {x}, i4iU {x}, 

and (x, y) i— >• x U {y} respectively. They write: 

In our heuristic classification of the axioms, as to whether they are instances of 
the axiom schema of comprehension or not, the axiom of infinity can, to some 
extent, be viewed as such an instance. Each of Via and VIb can be stated 
as 'there exists a set which consists of all natural numbers', for the respective 
notion of natural number, and Vic can be stated similarly. 

The authors seem to be treating 'being a natural number' as a condition defining a class 
in the original imprecise sense. However, the authors do have the precise notion of a 
condition as being given by a formula in the first-order logic of G. I do not know why 
they do not directly address the question of whether, without the Axiom of Infinity, there 
is still a formula defining the natural numbers. 

The Recursion Theorem for iterative structures (Theorem [3] above) can be extended to 
include the following, which gives conditions under which the equivalence of properties [T] 
and [3] in §T]can be established. 

Theorem 5. An iterative structure is a simply infinite system if and only if it can be 
ordered so that x < x s and one of the following holds: 

(a) it is well-ordered, and every element besides 1 is x s for some element x, or 

(b) it admits induction. 

Proof. Every simply infinite system is isomorphic to tu, which is ordered as indicated so 
as to satisfy (jsj). 

Suppose 25 is ordered as indicated so as to satisfy (jaj), and 21 is a substructure. If 
B \ A were non-empty, then its least element would be c s for some c; but c < c s , so 
c G A, hence c s G A. Therefore B \ A is empty. Thus 53 satisfies ([b]). 

Suppose finally 21 is ordered as indicated and satisfies ([b]). As noted in ^3} addition can 
be defined in 21 to satisfy (jvT|) . Again by induction, the ordering of 21 satisfies x < x + y. 
Let h be the homomorphism from (N, l,s) to 21. Then h is surjective by induction in 
21, and h(x + y) = h(x) + h(y) by induction on y in N. Moreover, in N, if x < y, then 
x + z = y for some z, so h(x) + h(z) = h(y) and therefore h(x) < h(y). Thus h is 
injective, so h is an isomorphism, and therefore 21, like N, is a simply infinite system. □ 

Ordinals suggest the following alternative form of Theorem that is, another way to 
obtain the class f2 in (jxj). 

Theorem 6. Let C comprise every set 

(1) whose every element is or {x} for some element x and 

(2) which has a well-ordering < such that x < {x} for each element x such that {x} 
is also an element. 

Then (|J C, 0, x *— > {x}) is a simply infinite system, even without recourse to the Axioms 
of Infinity and Foundation. 

Proof. Suppose A G C. Then {x} is the successor of x in the well-ordering of A; that is, 
there is no y such that x < y < {x}. Indeed, if there is, let x be least such that there 
is; and then let y be least. But y 7^ 0, since must be the least element of A. Hence 
y = {z} for some z, and then z < x by minimality of y, so z < x < {z}, contrary to the 
minimality of x. 
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Now suppose also B G C. Then one of A and B is an initial segment of the other, 
when both are considered as well-ordered sets. Indeed, one of them, say A, is isomorphic 
to an initial segment of the other, B. If the isomorphism is h, then there can be no 
least element x of A such that h{x) 7^ x, since if there is, then x = {y} for some y, so 
h{x) = h({y}) = {h(y)} = {y} = x (since {y} succeeds y, and {h(y)} succeeds h(y), in 
the respective orderings). 

It now follows that |J C satisfies condition [a] in Theorem [5] □ 

Having ON, we can define the function R on ON recursively bjjfl 

R(a) = |J{^(R(/3)): f3 < a}. 

Then |J R[ON] is called WF, the class of well-founded sets; it is a model of the set- 
theoretic axioms, including Foundation, regardless of whether the latter 'really' holds; so 
we might as well assume that it does hold. Then WF is the class of all sets. 

Similarly, there is a function L on ON such that |J L[ON] — the class of constructible 
sets — is a model of all of the set-theoretic axioms, including Choice, regardless of whether 
the latter 'really' holds; so we may assume that it does hold0 

Now all of the set-theoretic axioms are consistent with those that can be expressed in 
the form 'Such-and-such a class is a set', along with Extension. 

Part 3. 

7. Algebras and orderings 

Having tu, we can define arbitrary structures. This is done at the beginning of any 
model-theory book (such as Hodges [19] ), except that these books generally assume that 
structures and their signatures are based on That assumption is not made here. 

If C is a class, and n £ w, then C n can be understood as the class of functions from 
ra into C. (Such functions are indeed sets, by the Replacement Axiom, so there can be 
a class of them.) An ra-ary relation on C is a subclass of C n . In particular, a miliary 
(0-ary) relation is either or {0}, that is, or 1 (the sets composing B as used in ^j). 
An ra-ary operation on C is a function from C n into C. In the most precise sense, a 
signature can be defined as two disjoint classes — of predicates and function-symbols 
respectively — together with a function from their union into tu. A signature (o5^ p , , ar) 
may be written more simply as J^, and 5? may be treated as U ; then a structure 
with this signature is a class C together with a function s 1— > s £ on 5? such that if 
ar(s) = n, then s ff is an ra-ary relation (when s E ^ p ) or operation (when s G JT'f) on 
C. The class C is then the universe of the structure, and s c is the interpretation of 
s in the structure; but as noted in we ma y refer to the interpretation as s also. I 
follow the tradition of denoting a structure by the Fraktur form of the letter used for its 
universe. 



25 I use the notation of Kunen |2D1 p. 95]. It seems the original definition is by von Neumann. 
2< ^For the same reason, we may assume that the Generalized Continuum Hypothesis holds; but we 
don't. 

27 Ziegler f^j treats the 'monster model' of a complete theory as based on a proper class; but the 
signature is still a set. 
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An algebra is a structure whose signature has only function-symbols. If 21 and 03 are 
algebras with the same signature, then a homomorphism from the former to the latter is 
a function h from A to B, inducing the function h from A n to B n for each n in tu, such 
that 

h(F*(x)) = F <s (h{x)) (xv) 

for all x in A n , for all n-ary F from the signature, for all n in cu. In case n = 0, (jxvj) 
becomes h{F^) = F^. An algebra admits recursion if from it to every algebra in the 
same signature there is a unique homomorphism; an algebra admits induction if it has 
no proper sub-algebra. I do not know who first formulated the following generalization 
of Theorem [2 though Enderton \TU\ p. 27] alludes to such a theorem: 

Theorem 7 (Recursion). An algebra admits recursion if and only if it admits induction 
and its distinguished operations are injective and have disjoint ranges. □ 

In a signature with no miliary function-symbols, the only algebras that admit induction 
are empty. An arbitrary algebra that admits recursion can be called free. All free 
algebras in the same signature are isomorphic. 

If the universe of an algebra OS is a set, then 03 has a subalgebra 21 that admits induc- 
tion. Indeed, the universe of 21 is the intersection of the set of universes of subalgebras 
of 03. In short, 21 is the smallest subalgebra of 03. If the universe of 53 is a proper class, 
then we cannot obtain 21 in this way. We might use a trick to obtain 21, as in Theorem 
or [6J Alternatively, if we can obtain a free algebra $ in the same signature, then 21 will 
be the image of 5 in 03. 

In an arbitrary algebraic signature J?, we can obtain a free algebra as a subalgebra of 
the algebra <5(<5^) of strings of y. The universe of & is S? <a> , namely U{^ n : n ^ w }l 
but in this context an element k 1— > of Sf n is written as sq ■ ■ ■ s n -i. The length of 
every element of 5^ n is n. If F is an n-ary symbol of =5^, then F 6 is the operation 

(to, • • • ,t n -i) 1 — ► Ft ■ ■ ■ t n -x (xvi) 

of concatenation on 6. Here the length of Fto ■ ■ ■ t n -\ is one plus the sum of the lengths 
of the tk- Because the class w of possible lengths is well-ordered, we have the following. 
The theorem is a requirement for doing symbolic logic, but perhaps not every logician 
bothers to work out the proof. 

Theorem 8. &( r y) has a subalgebra that is a free algebra. 

Proof. Let C be the class of subsets b of y <a> such that every element of b is Fto • • • t n —\ 
for some elements i/% of b, for some n-ary F in S fi 1 for some n in tu. Let A = (J C. Then 
A is the universe of a subalgebra 21 of & (=5^), since if F is an n-ary element of =5^, and 
(tfc : k < n) £ A n , then each tk is in some bk in C, and then 

\J{b k : k < n}U{Ft ---tn-i} G C, 

so Fto ■ ■ ■ t n -i is in A. 

The algebra 21 admits induction. Indeed, if not, then it has a proper subalgebra with 
universe B. An element of A \ B of minimal length has the form Fto ■ ■ ■ i n _i, where 
each tk must therefore belongs to B; but then Fto ■ ■ ■ t n -\ must also be in B. 

Immediately the operations F & have disjoint ranges. 
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Proper initial substrings of elements of A are not elements of A. Indeed, suppose this 
is so for every element of A that is shorter than the element Fto ■ ■ ■ t n -\. If Fto ■ ■ ■ i n _i 
has an initial segment that is in A, then this segment must have the form Fuq ■ ■ ■ u n -i, 
where one of to and uq is an initial segment of the other, so by inductive hypothesis they 
are both equal. Likewise, if, for some k in n, we have that ti and U{ are equal when i < k, 
then tk = Uk- By induction in n, we have Fto ■ ■ ■ t n -i = Fu$ ■ ■ ■ u n —\. By induction on 
lengths, proper initial segments of elements of A are not elements of A. 

Consequently each operation F 6 on A is injective. By Theorem [7] then, 21 does indeed 
admit recursion. □ 

The free algebra guaranteed by the theorem can be denoted by 

Tm (y); 

it comprises the closed terms of y in Lukasiewicz or 'Polish' notation [4, n. 91, 

p. 38]. For example, suppose = {0, s, t}, where is miliary, and s and t are singulary. 
It will be useful to note that the elements of Tm°(y) can be arranged in a tree: 




ssO tsO stO ttO 

In the signature {0, +}, the ordering of Tm°( I 5^) is more complicated: 



++0++000++0+00+00 




Again, it was because we already had cu as a class that we could obtain Tm (^) 
without working out a full analogue of Theorem [3] or [6) However, I wish now, in the 
remaining sections, to work out an analogue of cu itself in the signature y. 

8. Numbers, generalized 

If y is an arbitrary algebraic signature, we want to obtain a class w r y that is to cu as 
y is to {l,s}. We can consider cu to arise as follows. 
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One starts with the assumption that there is some simply infinite system (N, l,s). 
With some difficulty, one shows that there is a total ordering of N such that x < X s for 
all x in N; then one shows that N is well-ordered by <. One might for example use the 
method that Skolem preferred to avoid, as noted in § 

x < y 3z x + z = y. 

Regardless of Skolem's concerns, an infelicity in this definition is its dependence on 
addition, which in turn depends on whether one treats the initial element of N as 1 
in the usual sense. In any case, Now one has a function g on N as in (|xiiij) . so that 
g{x) = <?[pred(x)]. One then computes 

g(l) = g[0] g(x + 1) = fir[pred(x + 1)] (xvii) 

= 0, = g[pred(x) U {x}] 

= 3[pred(x)] U g[{x}} 

= g{x) u {g(x)}. 

Let x U {x} be denoted by 

x'. 

Thus g is a homomorphism from (N, l,s) to (V, 0,'), and the latter has a substructure 
with universe g\N] that admits induction. By the Foundation Axiom, the operation 
x i—T- x' on V is injective. Therefore g is an embedding, and (g[N],0,') is a simply 
infinite system. The ordering of g{N] induced from N by g is membership. By induction, 
each element of g[N] is transitive. One then defines ON to consist of the transitive 
sets that are well-ordered by membership. Then (ON,0,') is an iterative structure; 
moreover, ON is transitive and well-ordered by membership. An element of ON that 
neither is nor belongs to the image of x i— > x' is a limit. One defines cu as the class 
of all elements of ON that neither are limits nor contain limits. Then cu contains and 
is closed under x i— > x', but no proper subclass C of cu is the universe of a substructure 
of (cu,0,'), since the least element of cu \ C is either or else a' for some a in C. 
Therefore one has recovered g\N] as cu. In particular, there is no need to use the Peano 
Axioms to define an ordering that well-orders N; such an ordering is induced from cu. 

Again, as Skolem observed, the ordering of N can also be defined recursively, without 
reference to cu, by 

pred(l) = 0, pred(n + 1) = pred(n) U {n}. (xviii) 

We want similarly to define an ordering on the free algebra Tm°(y). Meanwhile, as 
a first generalization of (jxviij) . on Tm°(y) we have the height function, hgt, given 
recursively by 



hgt(Ft •••*„_!) = |J{hgt(t fc )U{hgt(t fc )}: k<n}. 



(xix) 
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By induction, hgt(x) is in cu, and hgt(Fto ■ ■ ■ i n -l) i s the greatest of the numbers 
hgt(ifc) + 1 (so it is if n = 0). In a generalization of (jxviiip . we can make the re- 
cursive definitions 



pred(Ft ■ ■ ■ i n -i) = [j{pred(t k ) U {t k } : k <n}, 
pred k (Ft ■ ■ ■ i n _i) : 



pred(t fc ) U {t k }, if&<n, 
0, if k > n. 



Immediately, 



pred(Ft • • • i n -i) = (J{pred fc (F£ ■ ■ ■ £ n -i) : k G cu}. (xx) 

Let us also write 

£ < u <J=^ i € pred(u), t < k u <^=^ i G pred fc (u). ( xx i) 

An ordering directs a class if every finite subset has an upper bound in the class with 
respect to the ordering. An ordering is well-founded on a class if every section of the 
class is a set and every subset of the class has a minimal element. 

Lemma 1. 0?( Tm°(y): 

(1) x < y if and only if x < k y for some k in cu; 

(2) if x < y and y < k z, then x < k z; 

(3) < and < k are well-founded orderings; 

(4) < directs pred fc (x); 

(5) tk is maximal with respect to < in pred fc (-Fio " " " tn-i), assuming k < n. 
Proof. Condition (jrj) is a restatement of (jxxj), and ([3]) is 

y < k z => pred(y) C pred fc (z), 

which is established by induction on z. Indeed, suppose the claim holds when z G 
{to, . . . ,t n -i}, but now y < Fto ■ ■ ■ t n _\. Then either y < k t k or y = t k for some k in n. 
In the former case, by inductive hypothesis and (jrj), pred(y) C pied k (t k ) C pred(tfc). 
Hence, in either case, 

pred(y) C pred(t fc ) C pred fc (Ft • ■ • 

so the claim holds when z = Fto ■ ■ ■ tn-i- 

By ([1]) and ([3]), the relations < and < k are transitive. To complete (gj), we observe by 
induction on u that 

v < u => hgt(f) G hgt(u). 
As G is irreflexive, so is <, and hence so is by (jlj). As G is well-founded, and the 
classes pred(y) are all sets, < and are well-founded. 

Finally, both ^ and ([5]) follow from the observation that, by definition, t k is the 
maximum element of pied k (Fto ■ ■ ■ t n -\) with respect to <, if k < n. □ 

Proving the lemma would take more work if the corresponding ordering G of cu were 
not available. One could use the ordering on N as defined by (|xviiij) : but proving directly 
that it is a well-ordering takes more work than proving the same for G on cu. In any 
case, we can now prove a generalization of Theorem [5} 

Theorem 9. An algebra in a signature & is free if and only if 
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(1) it has orderings < and < k for each k in w, with corresponding sets of predecessors 
as in (jxxip . such that 

(a) x < y if and only if ' x <k y for some k in w, 

(b) if x < y and y < k z, then x <k z, 

(c) < directs pred fc (x) , 

(d) Xk is a maximal element with respect to < of pred fc (i ? (xo, • • • , x n -i)), as- 
suming k < n, 

(2) the distinguished operations have disjoint ranges, and 

(3) one of the following: 

(a) the union of these ranges is the whole underlying class of the algebra, and < 
is well-founded, or 

(b) the algebra admits induction. 

Proof. If an algebra in 5? is free, then it is isomorphic to Tm°( t y), so follows from 
the lemma. Also (jjj) and ( jgaj ) follow from this and Theorem [7J 

Suppose 53 satisfies (jxj) , (jlj), and ( jgaj ), and 21 is a subalgebra. If B \ A is nonempty, 
then it has a minimal element of the form F(xq, . . . , x n _i); but then the Xk must be in 
A, and then so must F(xq, . . . , x n -\) be. Thus 23 = 21, so it satisfies ( febp . 

Finally, suppose 21 satisfies (Jrj), ([2]), and (3E). By Theorem [7) to conclude that 21 is free, 



it is enough to show that each is injective. By induction in 21, the homomorphism h 
from Tm°(y) to 21 is surjective. If some F^ is not injective, then there is some minimal 
Ft ■ ■ ■ tn-l in Tm°(y) for which 

F*(M*o), ■ • • , h{tn~i)) = h{Ft ■ ■ ■ tn-i) = h{Fu ■ ■ ■ u n ^) = F*(h{u ), h(u n ^)) 

for some Fuq ■ ■ ■ n n _i, although /i(tfc) 7^ h(uk) for some k. But h(tk) and h(uk) are 
maximal, and the set of them has an upper bound, in pred fc (/i(-Fio • • • i n _i)); hence they 
are equal. This contradiction shows 21 is free. □ 

Towards obtaining 0)y as desired, suppose x is an (n + l)-tuple (xo, • • • , x n ) for some 
n in CO. Let us say that the type of x is x n ; if k < n, let us say that the elements of x 
of grade k are the elements of Xj~. We may use the following notation: 

y £ k (x , . . . , x n ) <^=^ k < n & y G x k , 

j U( x fc : ^ < n }' if x = (xo, • • • , x n ) for some n in tu and some x^, 
1 0, otherwise, 

y G x <J=^ y G ^(x). 

Here @(x) is the set of 'graded' elements of x. Any elements of x n , as such, are left out. 

The notion of well-foundedness makes sense for arbitrary binary relations Q such as 
g'. Indeed, a binary relation R is well-founded on a class C if 

(1) the class {x : x G C & x i? a} is a set whenever a G C, and 

(2) if 6 is a nonempty subset of C, then 6 has a minimal element with respect to R, 
that is, an element d such that 

bn {x: x R d} = 0. 

By the axioms of Infinity and Choice, condition ^ is equivalent to 



2 ^This observation, with the ensuing definition, is apparently due to Zermelo [24} II. 5.1, p. 63]. 
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(2*) there is no sequence (a n : n G cu) of elements of C such that a n+ \ R a n for each 
n in cu. 

In particular, well-founded relations are irreflexive. The Foundation Axiom is just that 
membership is well-founded on V. If R is well-founded on C, then (C, R) admits 
induction in the sense that the only subclass D of C for which 

C D{x: x Ra} C D a£fl 

for all a in C is C itself. 

Theorem 10. The relation G' is well-founded on V. 
Proof. Using the definition (jxij) for ordered pairs, we have 

(x ,...,x n ) = |{{0},{0,x }},...,{{n},{n,x n }}|, 

(J (J(x , • • • , x n ) = {0, . . . , n,x , ■ ■ ■ , x n }, 

so {y : y G 7 x} C (J (J |J x, a set. Suppose there were a sequence (x n : n G cu) such that 
always x n +i G' x n . Then always x n+ \ G UUU 2 -"- But then, assuming x n G R(a n ), we 
should have also |J|J(Jx n G R(a n ), and then x n+ \ G R(a n +i) for some a n +i that was 
strictly less than a n . There is no such sequence (a n : n G cu) of ordinals. □ 

For a better generalization of (jxviip than (jxixjl . we make V into an oS^-algebra by 
defining 

F V (x , . . . , X n _l) = (SP(z„) U {x }, ^(x n _i) U F) (xxii) 

for all F in S*. 

Theorem 11. The operations F v are injective and have disjoint ranges. 

Proof. That the ranges are disjoint is immediate from the definitions. If 

F v (x , . . . , x n -i) = F v (y , y n -i), 

but Xk 7^ yk for some k, then, since 

&(x k )U{xk} =^(y fc )U{y fc }, 

we have x/% G @(yk) an d G ^(x&), that is, x/c g' and g' Xk- But this contradicts 
Theorem \TU\ □ 

Considering V as an ^-algebra, we can now define 

as the homomorphic image of Tm°(y) in V. Then ujy is free by Theorems [7] and QTJ 
But I propose to obtain cu^' alternatively as a certain subclass of a class ONy, just as 
cu is obtained from ON. 
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Let us define 
so that 

and 



9. Ordinals, generalized 
pred fc (x) = {y: y G fc x}, 

pred fc ((x , . . .,x n )) = 



if k < n, 
otherwise, 



&(x) = ^J{pred fc (x) : k G tu}. 
Let us say that x is fc-transitive if 

y G pred fc (x) => ^(y) C pred fc (x). 
In order to define ONy, we first define a class Dy, which comprises all x such that 

(1) x is an (n + l)-tuple having the type of an n-ary element of 5? for some n in tu, 
and then pred fc (x) 7^ whenever k < n; 

(2) each element y of (x) is an (m + l)-tuple having the type of an m-ary element 
of r y for some m in tu, and then pred^(y) 7^ whenever I < m; 

(3) x is fc-transitive for each k in tu; 

(4) each element of &(x) is fc-transitive for each k in cu. 

We shall presently define ONy as a subclass of Dy. Meanwhile, we already have some 
analogues to properties of ON. 

Lemma 2. The relation G' is transitive on Dy. 

Proof. If Dy contains x, y, and z, where y G' x and z G' y, then y G 5^(x), so y G 
pred fc (x) for some fc, and therefore Sf(y) C pred fc (x) C <S(x) by Q; but also z G 5^(y), 
so z G ^(x), that is, z £' x. □ 

Also, like ON itself, Dy has a kind of transitivity: 

Lemma 3. If x G Dy, £/ien £f(x) C Dy. 

Proof. Suppose x G Dy and y G ^(x). Then y satisfies (JTJ) , by for x. Also y 
satisfies Q, by (gj) for x. Finally, y G pred fc (x) for some A; in tu, so by Q for x we have 
^(y) Q pred fc (x), hence 2f(y) C ?^(x). So y satisfies ([3]) and (gj). Therefore y G Dy. □ 

When y = {l,s}, then Dy is not really anything new. Indeed, informally, 

D {1 , s } = {(l),({(l)},s),({(l),({(l)},s)},s),...}. 

Using the definition in (jxxiip . we have: 

Theorem 12. (D {ljS} , l v , s v ) = (ON, 0,a^a + l). 

Proof. Note that l v is the 1-tuple (1). Fr om ON to V, there is a function H defined 
recursively by 

H / )= f(l), ifa = 0; 

1 ' |(ff[a],s), if a ^0. 

Then 

9{H(a)) = pred (H(a)) = H[a}. (xxiii) 
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Therefore 

H(a + 1) = (if [a] U {if (a)}, s) = (&(H(a)) U {H(a)}, s) = s v (ff(a)), 

which means if is a homomorphism of {1, s}- algebras. If a < /3, then H(a) G H[/3], so 
if (a) &' H(/3) by (Ixxiiip . Therefore if is order-preserving and, in particular, injective. 
It remains to show that the range of if is Drj a. 

In the definition of D{ l s j, conditions and hold for elements of if [ON] by 
definition of if. Then Q follows; that is, if (/?) is always fc-transitive. Indeed, it is 
trivially so, if j3 = or k > 0; and by (jxxiiip . we have that, if x G pred (if (/?)), then 
x = if (a) for some a, so 

9(x) = H[a] C H\fi] = pred (ff (/3)). 

Since each element of ^(if(/3)) is some H(a), it is /c-transitive too: thus Q. So H 
maps ON into D| l s |. 

By induction in D{i, s } with respect to the well-founded relation g', we establish 
D{i jS } ^ if [ON]. Suppose x G Dr ls \. Then &(x) C D| ls | by the lemma. As an 
inductive hypothesis, suppose C if [ON]. If ff(/3) G 5f (x), and a < (3, then, since 
ff(a) G &(H(J3)) by (jxxiiip . we have ff(a) G ^(ac) by the O-transitivity of x. Thus 
{a: H(a) G §f(x)} is transitive, so it is an ordinal f3, and H[f3] = <S{x). If /3 = 0, then 
x = (1) = H(f3) by the second part of condition ([1]) in the definition of D.n s \; if /3 7^ 0, 
then ^(x) 7^ 0, so x can only be (^(x),s), which is also H((3). So if is an isomorphism 
between ON and D/ l s i. □ 

An element x of Dy can be called a limit if some set pred fc (x) has no maximal element 
with respect to G ; . In general, there may be x in D y such that neither x nor any element 
of @(x) is a limit, but still x is not in tuy. Indeed, if 5? = {a, 6, s}, where a and 6 are 
miliary, and s is singulary as usual, then D{i, s } contains ({(a), (6)},sj, which is not in 
Wy . So we let 

denote the class of x in T)y that meet the additional conditions 

(5) G' directs each set pred fc (x); 

(6) g' directs each set pred^(y) for each y in &(x). 

We already know from Theorem \W\ that G' is well-founded on ONy and its elements; 
but this relies on the axioms of Foundation, Infinity, and Choice. To avoid this reliance, 
we can impose one additional condition on the elements x of ONy: 

(7) G' is well-founded on <S{x). 

As will be shown with Theorem fi6lbelow. this definition of ONy is enough to ensure that 
distinguishing wy as a subclass of ONy is analogous to distinguishing cu as a subclass 
of ON. Meanwhile, we can work out a particular example, as in the next section. 

10. Trees 

Suppose 5? = {0,s,t}, where is miliary, and s and t are singulary. 

Let x G ONy. Then pred (x) = Sf(x). Consequently, &(x) is well-ordered by g'. 
Indeed, to establish this, by Lemma [2] and ([7]), it is enough to note that G' is total on 
&(x). Suppose on the contrary that has incomparable elements y and z. Then 
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y, z g' ao for some ao in &(x), by Q. But then y, z G' a\ for some a\ in ^(ao), by ©. 
But ai € @(x) by (gj); so y, z G ; a% for some 02 in (ai), and so on. This violates (jjj). 
Since each 

11. Numbers as ordinals 
Theorem 13. The relation g' is a well-founded ordering o/ONy. 

Proof. By Lemmas g' is transitive on ONy. If x G ONy, then, because G 7 is well- 
founded on £f(x) by Q, in particular x ^ x. Therefore g' orders ONy. Also {y: y G 
ONy & y G 7 x} is a set, being a subclass of Sf(x). Suppose 6 C ONy and x G b. If 
x is not a minimal element of 6 with respect to G', then b n {y: y G 7 x}, being a subset 
of §f(a:), has a minimal element; but this minimal element is minimal in b as well, by 
transitivity of g'. Therefore G' is well-founded on ONy. □ 

Theorem 14. If x £ ONy, then Sf(x) C ONy. 

Proof. Suppose x G ONy and y G Sf(x). By LemmagJ we know y G Dy. By © for x, 
we know y satisfies Q; by Q also, y satisfies ([U]). Finally, since 5f(y) C 3f (x) by Q, we 
have (j7j) for y. □ 

Theorem 15. ONy is an 5? '-subalgebra o/V. 

Proof. Suppose Xfc G ONy when k < n, and let y = -F v (xo, • • • , x n _i). We show that y 
satisfies the defining conditions of ONy. 

1. First, y satisfies ((Tj) by definition, that is, (jxxiip . 

2. Suppose z G 5^(y); equivalently, z G pred fc (y) for some k in n. But 

pred fc (y) = &(x k ) U {x fc }, (xxiv) 

so in particular either z = x k or z G Sf(xfc). By (jrj) and respectively for x/%, we have 
that y satisfies (j^j). 

3. Also, if z = Xfc, then 

Sf(z) = Sf(x A ) C pred fc (y), 
while if z G £^(xfc), then z G pred^(xfc) for some £, so that 

S?(z) C pred £ (x fc ) C &(x k ) Q pred fc (y) 

by Q for Xfc and (jxxivp . Thus y satisfies Q. 

4. If z = Xfc, then z is ^-transitive for each £ in cu by (M for Xfc, while if z G 5f(xfc), 
then z is ^-transitive for each I in cu by faj) for Xfc. Therefore y satisfies fort, and so 
y G Dy. 

5. By (jxxivp . we have that Xfc is the greatest element of pred fc (y) with respect to G', 
so this ordering directs predfc(y). Thus y satisfies Q. 

6. If z = Xfc, then g' directs pred^(z) by fej) for Xfc, while if z G Sf(xfc), then g' directs 
pred^(z) by (|6|) for x\~. Therefore y satisfies ([6]). 

7. We have 5f(y) = \}\^S{xk) U {^fc}: & < n }- By Theorems [T3I and pnfl since g' is 
well-founded on each 5f (x^), it is also well-founded on each set ^(xfc) U {xfc} and hence 
on their union. So y is in ONy. □ 

Now cu y can be understood as the image of Tm°(y) in ONy. But again, we want 
to obtain tuy independently from Tm°(y). 
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Theorem 16. Wy consists of those x in ONy such that neither x nor any element of 
'Six) is a limit. 

Proof. By Theorem [13J we can argue by induction. Suppose Xk meets the conditions 
when k < n. Let z = F (xo, . . . , x n -\). Each Xk is a maximal element of pred fc (z), so z 
is not a limit. Each element of (2) is either Xk or an element of c S{x\^) for some k in n, 
so it is not a limit either. Thus z meets the conditions. 

To prove the converse, suppose if possible that (xq, . . . ,x n -i,F) is a counterexample 
that is minimal with respect to G ; . Then each of the x^ has a maximal element, Uk, with 
respect to G ; . But g' directs X/., so Uk is the greatest element, hence x^ C Sf(itfc) U 
Also &(uk) C Xfc by the /c-transitivity of {xq, . . . , x n -\, F). Thus 

(x , . . .,x n -x,F) = (&(uq) U {u }, . . . ,^{un-x) U {n„,_i},F) = F v (u , . . . ,u n -\). 

But also, neither Uk nor any element of 3f is a limit, so Ufc G iu,y by minimality of 
(xq, . . . , x n ~i, F), and therefore F v (uq, • • • , u n -i) G to^«. □ 
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